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What is Claimed: 



1 1 . A method for accurately estimating a pose of a camera within a 

2 scene using a three dimension model of the scene, comprising the steps of: 

3 (a) generating an initial estimate of the pose; 

4 (b) selecting a set of relevant features of the three 

5 dimensional model based on the initial estimate of the pose; 

6 (c) creating a virtual projection of the set of relevant 

7 features responsive to the initial estimate of the pose; 

8 (d) matching a plurality of features of an image received 

9 from the camera to the virtual projection of the set of relevant 

10 features and measuring a plurality of matching errors; and 

11 (e) updating the estimate of the pose to reduce the 

12 plurality of matching errors. 

1 2. The method for claim 1, further comprising the step of: 

2 (f) repeating to steps (c), (d), and (e) using the updated 

3 estimate of the pose until the plurality of matching errors are less 

4 than predetermined matching criteria. 

1 3. The method of claim 1, further comprising the steps of: 

2 (f) adding the updates to the estimate of the pose made 

3 in step (e) to a total change value of the estimate of the pose; 

4 (g) if the total change value is less than predetermined 

5 pose change criteria, resetting the total change value and repeating 

6 to steps (b), (c), (d), (e) and (f) using the updated estimate of the 

7 pose; and 
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8 (h) repeating to steps (c), (d), (e), (f) and (g) using the 

9 updated estimate of the pose until the plurality of matching errors 
10 are less than predetermined matching criteria. 

1 4. The method of claim 1, wherein step (a) includes the step of 

2 comparing an image received from the camera to the three dimensional model of the 

3 scene to generate the estimate of the pose. 

1 5. The method of claim 1, wherein step (a) includes the step of using a 

2 preceding pose estimate of a preceding image received from the camera to generate the 

3 estimate of the pose. 

1 6. The method of claim 1, wherein step (b) includes the steps of: 

2 (bl) Z-buffering the three dimensional 

3 model based on the estimated pose to determine a set of visible 

4 features of the three dimensional model; and 

5 (b2) selecting the set of relevant features 

6 from the set of visible features. 

1 7. The method of claim 1, wherein step (b) includes the steps of: 

2 (bl) creating a set of features of the three 

3 dimensional model including a plurality of edges of at least one 

4 object represented in the three dimensional model, each edge 

5 having a dihedral angle of greater than a predetermined angle; 

6 (b2) selecting the set of relevant features 

7 from the set of features of the three dimensional model. 

1 8. The method of claim 1, wherein step (c) includes the step of 

2 removing a model feature from the set of relevant features if the model feature is less than 

3 a predetermined distance from a remaining relevant feature in the virtual projection. 

i 9. The method of claim 1, further comprising the step of: 




(f) perturbing the estimate of the pose and 

repeating to steps (c), (d), and (e) using the perturbed estimate of 
the pose until the plurality of matching errors are less than 
predetermined matching criteria. 

10. The method of claim 1, wherein step (d) includes the steps of: 

(dl) computing an oriented energy image of 

the image received from the camera; and 

(d2) integrating the oriented energy image 

along the virtual projection of the set of relevant features and 
measuring the plurality of matching errors. 

11. The method of claim 1, wherein step (d) includes the steps of: 

(dl) computing an oriented energy image of 

the image received from the camera; 

(d2) generating a set of scaled oriented energy 

images; 

(d3) selecting a scaled oriented energy image 

from the set of scaled oriented energy images; and 

(d4) integrating the selected scaled oriented 

energy image along the virtual projection of the set of relevant 
features and measuring the plurality of matching errors. 

12. A method for accurately estimating a pose of an image of a scene 
using a three dimension model of the scene, comprising the steps of: 

(a) performing a pyramid decomposition of the image to 

generate a set of pyramid levels of the image; 



(b) selecting one pyramid level from the set of pyramid 

levels; 
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7 (c) generating an estimate of the pose using the selected 

8 pyramid level; 

9 (d) selecting a set of relevant features of the three 

10 dimensional model based on the estimate of the pose; 

11 (e) creating a virtual projection of the set of relevant 

12 features responsive to the estimate of the pose; 

13 (f) matching a plurality of features of the selected 

14 pyramid level to the virtual projection of the set of relevant features 

15 and measuring a plurality of matching errors; 

16 (g) updating the estimate of the pose to reduce the 

17 plurality of matching errors; and 

18 (h) repeating to steps (e), (f), and (g) using the updated 

19 estimate of the pose until the plurality of matching errors are less 

20 than predetermined matching criteria which is responsive to the 

21 selected pyramid level. 

1 13. The method of claim 12, further comprising the steps of; , 

2 (i) removing the selected pyramid level from the set of 

3 pyramid levels; and 

4 (j) repeating to steps (b), (c), (d), (e), (f), (g), (h) and 

5 (i) until the set of pyramid levels contains no elements; 

6 wherein the selected level in step (b) has a lowest resolution level of the 

7 pyramid levels in the set of pyramid levels;. 

1 14. The method of claim 12, wherein step (e) includes the step of 

2 removing a model feature from the set of relevant features if the model feature is less than 

3 a predetermined distance from another relevant feature in the virtual projection, the 

4 predetermined distance being responsive to the selected pyramid level. 
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15. A method for refining a three dimensional model of a scene using 
an image of the scene taken by a camera having an unknown pose, comprising the steps 
of: 



(a) comparing the image to the three dimension model of 
the scene to generate an estimate of the pose; and 

(b) updating the three dimensional model of the scene 
based on data from the image and the estimate of the pose. 

16. The method claim 15, wherein step (b) includes the step of: 

(bl) projecting a plurality of color values of the image 

onto the three dimension model of the scene to update a texture of a 
surface of the three dimensional model. 



17. A method for accurately estimating a position of remote vehicle 
using a three dimension model and an image from a camera having a known orientation 
relative to the remote vehicle, comprising the steps of: 



(a) comparing the image to the three dimension model of 
the scene to generate an estimate of the pose; 

(b) selecting a set of relevant features of the three 
dimensional model based on the estimate of the pose; 

(c) matching a plurality of features of the image to the 
set of relevant features and measuring a plurality of matching 
errors; 

(d) updating the estimate of the pose based on the 
plurality of matching errors; and 



(e) determining the position of the remote vehicle based 

on the estimate of the pose and the orientation of the camera. 

18. The method of claim 17, further comprising the step of: 
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2 (f) updating the three dimensional model based on data 

3 from the image and the updated estimate of the pose. 

1 19. A method for refining a three dimension model of a scene 

2 containing an object using a plurality of images of the scene, each image including the 

3 object, comprising the steps of: 

4 (a) comparing a first image of the plurality of images to 

5 the three dimension model of the scene to generate an estimate of a 

6 first viewpoint corresponding to the first image; 

7 (b) comparing a second image of the plurality of images 

8 to the three dimension model of the scene to generate an estimate of 

9 a second viewpoint corresponding to the second image; 

10 (c) selecting a first set of relevant features of the three 

11 dimensional model based on the first viewpoint; 

12 (d) matching a plurality of first features of the first 

13 image to the first set of relevant features and measuring a plurality 

14 of first matching errors; 

15 (e) selecting a second set of relevant features of the three 

16 dimensional model based on the second viewpoint; 

17 (f) matching a plurality of second features of the second 

18 image to the second set of relevant features and measuring a 

19 plurality of second matching errors; and 

20 (g) updating a position estimate of the object within the 

21 three dimensional model of the scene based on the plurality of first 

22 matching errors and the plurality of second matching errors. 

l 20. The method of claim 19, further including the steps of: 



2 
3 



comparing the first image and the second image to generate relative pose 
constraints; and 
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updating the first viewpoint and the second viewpoint based on the relative 
pose constraints; 

21. The method of claim 20, wherein step of comparing the first image 
to the second image includes the steps of: 

matching the plurality of first features of the first image to the plurality of 
second features of the second feature to create a set of matched features; 



22. The method of claim 20, wherein the step of comparing the first 
image to the second image includes the steps of; 

generating a plurality of parallax measurements between the plurality of 
first features and the plurality of second features; and 

generating the relative pose constraints responsive to the plurality of 
parallax measurements. 

23. The method of claim 20, wherein step of comparing the first image 
to the second image includes the step of generating the relative pose constraints uses an 
epipolar geometry of the first image and the second image. 

24. A method for refining a three dimension model of a scene 
containing an object using a plurality of images of the scene, each image including the 
object, comprising the steps of: 



calculating optical flow for each matched feature; and 



generating the relative pose constraints responsive to the calculated optical 



flow. 



(a) 



selecting a subset of images from the plurality of 
images of the scene, the subset of frames containing at least two of 
the images; 



(b) determining a plurality of approximate relative 

viewpoints of the subset of images; 
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(c) comparing each image in the subset of images to the 
three dimensional model to generate a subset of estimated viewpoints 
corresponding to the subset of images, the subset of estimated 
viewpoints constrained by the plurality of approximate relative 
viewpoints; 

(d) selecting a set of relevant features of the three 
dimensional model corresponding to each estimated viewpoint; 

(e) matching a plurality of features of the each image in 
the subset of images to the corresponding set of relevant features 
and measuring a plurality of matching errors; and 

(f) updating a position estimate of the object within the 
three dimensional model of the scene based on the plurality of 
matching errors. 

25. The method of claim 24, further comprising the step of: 

(g) repeating to steps (c), (d), (e), and (f) using the 

updated position estimate of the object until the 
plurality of matching errors are less than 
predetermined matching criteria. 

26. The method of claim 24, wherein the subset of images is selected to 
have viewpoints within a predetermined range. 

27. A method for creating a hybrid three dimension model of a scene 
using a plurality of images of the scene, comprising the steps of: 

(a) creating a polyhedral model the scene including at 

least one polygonal surface; 



(b) determining a first set of images containing at least a 

section of a first polyjgonal surface; and 
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comparing a plurality of images selected from the 
first set of images to generate a local surface shape map 
corresponding to the first polygonal surface. 

The method of claim 27, wherein step (c) includes the steps of; 

(cl) calculating optical flow in regions corresponding to the first 
polygonal surface of pairs of the first set of images; and 

(c2) generating the local surface shape map responsive to the 
calculated optical flow. 

The method of claim 27, wherein step (c) includes the steps of; 

(cl) calculating parallax in regions corresponding to the first 
polygonal surface of pairs of the first set of images; and 

(c2) generating the local surface shape map responsive to the 
calculated parallax. 

The method of claim 27, further comprising the steps of: 

separating the local surface shape map of the first 
polygonal surface into a plurality of portions of not greater than the 
predetermined size; 

determining a first subset of images from the first set 
of images, each image of the first subset of images containing a first 
portion of the first polygonal surface; 

selecting at least one selected image of the first 
subset of images; 

projecting a corresponding section of the at least one 
selected image onto the first portion of the local surface shape map 
of the first polygonal surface of the hybrid three dimension model 
as a local color map; and 
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14 (h) repeating steps (e), (f), and (g) for each remaining 

15 portion of the local surface shape map of the first polygonal 

16 surface. 

1 31. The method of claim 30, further comprising the step of: 

2 (i) blending the plurality of local color maps 

3 corresponding to the plurality of portions of the local surface shape 

4 map of the first polygonal surface. 

1 32. A method for creating a textured three dimension model of a scene 

2 using a plurality of images of the scene, comprising the steps of: 

3 (a) creating a polyhedral model the scene including at 

4 least one polygonal surface; 

5 (b) identifying at least one portion of the one polygonal 

6 surface; 

7 (c) determining a first subset of images containing the at 

8 least one portion of the one polygonal surface; 

9 (d) selecting at least one selected image of the first 
io subset of images; 

n (e) projecting a corresponding section of each of the at 

12 least one selected image onto the at least one portion of the one 

13 polygonal surface of the textured three dimension model as a local 

14 color map; 

1 5 (f) repeating steps (c), (d), and (e) for each remaining 

16 portion of the one polygonal surface of the polyhedral model 



1 



33. 



The method of claim 32, wherein step (d) includes the steps of: 
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2 (dl) determining a resolution level for the at 

3 least one portion of the one polygonal surface of each image of 

4 the first subset of images; and 

5 (d2) selecting a selected image of the first 

6 subset of images corresponding to a largest resolution level for 

7 the at least one portion of the one polygonal surface. 

1 34. The method of claim 32, wherein: 

2 the at least one selected image contains at least two selected images; and 

3 step (e) includes the step of at least one of combining and selecting at least 

4 one color value of the at least one corresponding section of the at least two 

5 selected images to create the local color map of the at least one portion of the one 

6 polygonal surface. 

1 35. The method of claim 32, wherein the at least one portion includes a 

2 plurality of portions and the method further comprises the step of: 

3 (g) blending the plurality of local color maps 

4 corresponding to the plurality of portions of the one 

5 polygonal surface. 

1 36. The method of claim 35, wherein step (g) includes the step of: 

2 (gl) creating a plurality of resolution levels of 

3 the plurality of local color maps; and 

4 (g2) blending the plurality of local color maps 

5 over each resolution level. 

1 37. A method for creating a dynamic sequence of virtual images of a 

2 scene using a dynamically updated three dimension model of the scene, comprising the 

3 steps of: 
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(a) updating the three dimension model using a video 
sequence of images of the scene including the steps of; 

(al) determining a present viewpoint of a 

present image of the video sequence of images; 

(a2) determining a relevant portion of the three 

dimensional model corresponding to the present image of the 
video sequence of images; 

(a3) updating the relevant portion of the three 

dimensional model by projecting the present image onto the 
relevant portion of the three dimension model; 

(b) selecting a first virtual viewpoint of a first virtual 

image of the dynamic sequence of virtual images; 

(c) creating the first virtual image by projecting the 
dynamic three dimensional model onto the first virtual viewpoint; 
and 

(d) repeating steps (b) and (c) for each remaining virtual 
image of the dynamic sequence of virtual images. 

38. The method of claim 37, wherein step (a) further includes the step 

of; 

(a4) repeating step (a) using a remaining image 

of the sequence of images as the present 
image; 

39. The method of claim 37, wherein the selection of the virtual 
viewpoint in step (b) is responsive to at least one of; 

an external signal; 



a predetermined trajectory; 
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5 a motion of a selected pattern of the dynamic three-dimensional 

6 model; 

7 a motion of a selected object of the dynamic three-dimensional 

8 model; 

9 a motion of a selected pattern of a plurality of images of the 

10 sequence of video images; and 

11 a motion of a selected object of a plurality of images of the 

12 sequence of video images. 



g(J l 40. The method of claim 37, wherein the determination of the present 

III 2 viewpoint of the video sequence of images in step (a) is responsive to at least one of; 



3 an external signal; 

4 a predetermined trajectory; 

5 a motion of a selected pattern of the dynamic three-dimensional 

6 model; 

7 a motion of a selected object of the dynamic three-dimensional 

8 model; 

9 a motion of a selected pattern of a plurality of images of the 

10 sequence of video images; 

11 a motion of a selected object of a plurality of images of the 

12 sequence of video images; and 

13 the virtual viewpoint selected in step (b). 

1 41 . A computer readable medium adapted to instruct a general purpose 

2 computer to update a three dimensional model of a scene using the three dimensional 

3 model of the scene, an image received from a camera having an unknown pose, the 

4 method comprising the steps of: 
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(a) generating an estimate of the pose; 

(b) selecting a set of relevant features of the three 
dimensional model based on the estimate of the pose; 

(c) creating a virtual projection of the set of relevant 
features responsive to the estimate of the pose; 

(d) matching a plurality of features of an image received 
from the camera to the virtual projection of the set of relevant 
features and measuring a plurality of matching errors; 

(e) updating the estimate of the pose to reduce the 
plurality of matching errors; and 

(f) updating the three dimensional model of the scene 
based on data from the image and the estimate of the pose. 

42. An automatic three-dimensional model updating apparatus for 
accurately estimating a point of view of an image of a scene, relative to a three- 
dimensional model of the scene, and updating the three-dimensional model comprising: 

(a) estimating means for providing an estimate of the 
point of view of the image; 

(b) relevant feature selecting means for selecting a set of 
relevant features of the three dimensional model based on the 
estimate of the point of view; 

(c) virtual projection means for creating a virtual 
projection of the set of relevant features responsive to the estimate 
of the point of view; 
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(d) matching means for matching a plurality of features 

of the image to the virtual projection of the set of relevant features; 
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14 (e) measurement means for measuring a plurality of 

15 matching errors; 

16 (f) point of view refinement means for updating the 

17 estimate of the point of view to reduce the plurality of matching 

18 errors; and 

19 (g) model refinement means, responsive to the estimated 

20 point of view and to the image, for updating the three-dimensional 

21 model. 
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